A universal table model for categorical databases
نویسنده
چکیده
A consistent categorical database can be viewed as a single contingency table by taking the maximum-entr0p.y extension of its base tables. Such a view, here called the universaf table model, is needed to answer a user who wishes “cross-classified” data, that is, categorical data resulting from the combination of information contained in two or more base tables. In order to implement a universal table interface, we make use of a query-evaluation procedure; this allows for an appropriate answer to be generated whether the requested data are stored in the database or not and, then, have to be computed (i.e., estimated),
منابع مشابه
General Table Completion using a Bayesian Nonparametric Model
Even though heterogeneous databases can be found in a broad variety of applications, there exists a lack of tools for estimating missing data in such databases. In this paper, we provide an efficient and robust table completion tool, based on a Bayesian nonparametric latent feature model. In particular, we propose a general observation model for the Indian buffet process (IBP) adapted to mixed ...
متن کاملTable manipulation in simplicial databases
In [Spi], we developed a category of databases in which the schema of a database is represented as a simplicial set. Each simplex corresponds to a table in the database. There, our main concern was to find a categorical formulation of databases; the simplicial nature of the schemas was to some degree unexpected and unexploited. In the present note, we show how to use this geometric formulation ...
متن کاملConceptual and Visual Models for Categorical Data
A dynamic conceptual model for categorical data is described that likens observations to gas molecules in a pressure chamber. In this physical model frequency corresponds to pressure, and fitting a statistical model by maximum likelihood corresponds to minimizing energy or balancing of forces. The model provides neat explanations of many results for categorical data, extends readily to multiway...
متن کاملBayesian inference for square contingency tables
Inference for multivariate categorical data often proceeds by selecting a log-linear model from a set of competing models or, in a Bayesian approach, by averaging inferences over the set, weighted by posterior probabilities. In this paper, we use permutation invariance as a criterion for constructing a set of models for this purpose, for the common situation when the data form a ‘square’ contin...
متن کاملCategorical Data Analysis
This course introduces principles and analyses related to data with categorical outcomes. This course will consider topics such as probability distributions with categorical data, contingency table analysis, the generalized linear model, logit models and loglinear models. Students are expected to: a) learn to select methods appropriate for a question of interest for data with a categorical outc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Inf. Sci.
دوره 49 شماره
صفحات -
تاریخ انتشار 1989